Identifying distantly related protein sequences

نویسنده

  • William R. Pearson
چکیده

The most powerful method available today for inferring the biological function of a gene (or the protein that it encodes) from its sequence is similarity searching on protein and DNA sequence databases. With the development of rapid methods for sequence comparison, both with heuristic algorithms and powerful parallel computers, discoveries based solely on sequence homology have become routine. Indeed, the vast majority of the gene identifications in the recent descriptions of the Haemophilus influenzae (Fleischmann et ai, 1995), Mycoplasma genitalium (Fraser et ai, 1995), yeast (Dujon, 1996) and Methanococcus janesscii (Bult et ai, 1996) genomes are based only on protein sequence similarity. As more complete genomes become available, protein sequence comparison will become an even more powerful tool for understanding biological function.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design

We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all pos...

متن کامل

BLISS 2.0: a web-based tool for predicting conserved regulatory modules in distantly-related orthologous sequences

UNLABELLED BLISS 2.0 is a web-based application for identifying conserved regulatory modules in distantly related orthologous sequences. Unlike existing approaches, it performs the cross-genome comparison at the binding site level. Experimental results on simulated and real world data indicate that BLISS 2.0 can identify conserved regulatory modules from sequences with little overall similarity...

متن کامل

INTERALIGN: interactive alignment editor for distantly related protein sequences

SUMMARY Improving and ascertaining the quality of a multiple sequence alignment is a very challenging step in protein sequence analysis. This is particularly the case when dealing with sequences in the 'twilight zone', i.e. sharing < 30% identity. Here we describe INTERALIGN, a dedicated user-friendly alignment editor including a view of secondary structures and a synchronized display of carbon...

متن کامل

Increased detection of structural templates using alignments of designed sequences.

Protein structure prediction by comparative modeling benefits greatly from the use of multiple sequence alignment information to improve the accuracy of structural template identification and the alignment of target sequences to structural templates. Unfortunately, this benefit is limited to those protein sequences for which at least several natural sequence homologues exist. We show here that ...

متن کامل

Genetic Analysis of Three Structural Proteins in Iranian Infectious Bronchitis Virus Isolate

Infectious bronchitis virus (IBV) is a contagious pathogen in fowl that results in economic loss in the poultry industry. In this study, the amino acids sequences of three structural proteins M, N, and S1 for five Iranian IBV isolated during 1998-2011 have been analyzed. Conserved and variable regions, hydrophobic characteristics and identity matrix were determined after alignment by Bioedit ve...

متن کامل

A Space-Efficient Approach towards Distantly Homologous Protein Similarity Searches

Protein similarity searches are a routine job for molecular biologists where a query sequence of amino acids needs to be compared and ranked against an ever-growing database of proteins. All available algorithms in this field can be grouped into two categories – either solving the problem using sequence alignment through dynamic programming, or, employing certain heuristic measures to perform a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer applications in the biosciences : CABIOS

دوره 13 4  شماره 

صفحات  -

تاریخ انتشار 1997